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HENRY TOWSNER 

Abstract. The use of tools from analysis to approach problems in 
graph theory has become an active area of research. Usually such meth- 
ods are applied to problems involving dense graphs and hypergraphs; 
here we give the an extension of such methods to sparse but pseudoran- 
dom hypergraphs. We use this framework to give a proof of hypergraph 
removal for sub-hypergraphs of sparse random hypergraphs. 



1. Introduction 

In this paper we attempt to bring together two recent trends in extremal 
graph theory: the study of "sparse random" analogs of density theorems, and 
the use of methods from analysis and logic to handle complex dependencies 
of parameters. 

To illustrate these methods, we will prove a version of the Hypergraph 
Removal Lemma for dense sub-hypergraphs of sparse but sufficiently pseu- 
dorandom hypergraphs. The original removal theorem was Rusza and Sze- 
meredi's Triangle Removal Lemma [27j) which states: 



Theorem 1.1. For any every e > 0, there is a 5 > such that whenever 
G Q (^) is a graph with at most 6\V\^ triangles, there is a set C C G with 
\C\ < e\V\'^ such that G\C contains no triangles at all. 

This result was later extended to graphs other than triangles [8], and 
ultimately to hypergraphs [ini[l5l[25] . All these arguments depend heavily 
on the celebrated Szemeredi Regularity Lemma [29], and its generalization, 
the hypergraph regularity lemma |15^ l26j. (Recently, Fox [9] has given a 
proof of graph removal without the use of the regularity lemma, which gives 
better bounds as a result.) 

There has been a growing interest in analytic approaches to graph theory. 
Probably the most widely studied approach is the method of graph limits and 
graphons introduced by Lovasz and coauthors [2ll23p24j . Related but distinct 
approaches have been studied by Hrushovski, Tao, and others p^[T3l[T6]. 
Analytic proofs of regularity and removal lemmas have been giving using 
all these methods [7l|30l[311|3l] . These techniques obtain a correspondence 
between a sequence of arbitrarily large finite graphs on the one hand, and 
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some sort of infinitary structure on the other. Statements about density 
fit naturally in these frameworks since the normalized counting measure 
on large finite graphs corresponds to an ordinary measure on the infinitary 
structure. 

In this paper, we describe a similar correspondence which applies to sub- 
hypergraphs of sparse, pseudorandom hypergraphs. In the finite setting, 
the natural replacement for the normalized counting measure is the count- 
ing measure normalized by the ambient hypergraph. This introduces new 
complications in the infinitary world: we end up with a natural measure on 
sets of fe-tuples which is not a genuine product measure. (This perspective 
on the problem was suggested to us by Hrushovski.) In place of a single 
measure, we end up with a family of measures, and the pseudorandomness 
from the finitary setting is used to ensure that this family of measures obeys 
certain compatibility properties. 

We use this method to give an analytic proof of sparse hypergraph removal. 
Our proof is heavily inspired by the combinatorial proof of triangle removal 
in sparse graphs i20,!21). Very recently, combinatorial proofs of sparse hy- 
pergraph removal (51 llll[28] have also been given by several authors. 

Our approach to hypergraph removal depends heavily on the use of the 
Gowers uniformity (semi)norms |14] . As Conlon and Gowers point out [5], 
such an approach cannot hope to give optimal bounds, and, relatedly, de- 
pends on a much stronger notion of pseudorandomness than strictly needed. 
We stick to this method both because we believe these norms are interest- 
ing in their own right, and because we believe it illustrates the analytic 
approach to sparse hypergraphs more clearly than an attempt to derive op- 
timal bounds would. 

In |13j . Isaac Goldbring and the author proposed a general framework 
for handling analytic arguments of this type, which we called approximate 
measure logic. In this paper, there is no assumption that the reader is 
familiar with that particular framework, but we pass quickly over the logical 
preliminaries, and refer the reader to that paper for more detailed exposition. 

We now give a brief outline of the paper. Sections [3] and [4] culminate in 
a proof of the dense hypergraph removal lemma, giving an outline of how 
we will prove sparse hypergraph removal. The reader interested primarily 
in the formal framework for handling sparse hypergraphs may wish to read 
Section [2] (which simply introduces notational conventions we use throughout 
the paper) and then skip to Section [5l where we actually introduce the 
framework. 

Section[3]introduces the cr-algebras Bv,x, which contain those sets of tuples 
which can be defined (approximately) using only certain restricted sets of 
coordinates. (For instance, the "cut-norm" used in the theory of graph 
limits is closely related to the norm of the projection of a function onto 
the simplest non-trivial example, Bv,i-) Proofs of hypergraph removal are 
typically divided into a regularity lemma and a counting lemma; in Section U] 
we define the notion of measure having regularity — an analog of satisfying an 
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infinitary analog of the regularity lemma — and then prove a counting lemma. 
We then show that the measures corresponding to dense hypergraphs have 
regularity, giving a proof of the ordinary (dense) hypergraph removal lemma. 
This forms the outline of our approach, and we spend the rest of the paper 
showing that certain sparse measures also have regularity. 

Section [5] finally introduces our formal framework: we define the notion 
of a canonical family of measures, a collection of measures having certain 
joint properties, and show that the ultraproduct of sufficiently random fi- 
nite graphs gives us such a family. Section [6] then introduces the Gowers 
uniformity seminorms and begins the project of showing that, under suit- 
able conditions, a function has positive uniformity seminorm exactly when 
it correlates with certain u-algebras. Finally, in Section [7] we complete the 
proof of this relationship for canonical families of measures, and show that 

We are grateful to Ehud Hrushovski for providing the crucial motivating 
idea, and to Isaac Goldbring for many helpful discussions on this topic. 

2. Notation 

Throughout this paper we use a slightly unconventional notation for tuples 
which is particularly conducive to our arguments. When V is a finite set, a 
V -tuple from G is a function x : V ^ G. If for each v G V we have designated 
an element Xy € G, we write xy for the tuple xv{v) = Xy. Conversely, if we 
have specified a F-tuple xy, we often write Xy for xv{v). When V,W are 
disjoint sets, we write xy U x\y for the corresponding V U T^-tuple. When 
/ C F and xy is a given V-tuple, we write xj for the corresponding /-tuple: 
xi{i) = xy{i) for i G /. We write 0^ for the tuple which is constantly equal 
to 0. (This is the only constant tuple we will explicitly refer to.) 

One of the key tools in this paper will be the use of the collection of 
definable sets in a model of first-order logic. We will refer to our models 
as 9Jt, and to the corresponding universes of these models as M, re- 
spectively. We will refer to formal variables in the language of first-order 
logic with the letter w, reserving the letters x, y and so on for elements of 
models (for instance, when integrating over a model). We will often refer to 
fixed elements of a model (used as constants or parameters) with the letters 
o, 6, c. In keeping with our tuple notation, we will often refer to finite sets 
of variables as wy^ww-, etc.. 

Recall that when is a formula with free variables wy^ 9JT is a model of 
first-order logic, and xy G M^, we write 971 ^ '^{xy) to indicate that the 
formula holds when we interpret each free variable by the element x^. 
A set S C is definable if B = {xy \ 971 1= ip{xy)} for some formula 
(f. When the model 97t is clear from context, we will often equate formulas 
with the sets they define — for instance, if S is a definable set, we will also 
consider B to be the formula defining this set, so by abuse of notation, 
B = {xy I 971 1= B{xy)}. If a set has multiple groups of parameters — 
say, B C M'^^^ — we will write B{aw) for the slice {xy \ aw ^ xy G B} 
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corresponding to those coordinates. We say B is definable from parameters 
ii B = C{aw) for some definable set C. 

Similarly, when / is a simple function built from sets definable from pa- 
rameters, so / = Y2i ^iXC'i where each Oj is rational and each Cj is definable 
from parameters, we sometimes view / as being a "rational linear combina- 
tion" of formulas, and refer to the union of the parameters defining all the 
sets Ci as the parameters of /. 

3. ct-Algebras 
Models come equipped with certain natural a-algebras. 

Definition 3.1. Let 9Jt be a model and let V he a finite set of indices. We 
define By to be the Boolean algebra generated of subsets of definable 
from parameters. 

For I QV, we define Byj to be the Boolean algebra generated by subsets 
of M" of the form 

{xv \xi £B} 

where B e Bj. 

If I C V{V) then we write By^ for the Boolean algebra generated by 
IJj^x^vx- When k < \V\, we define Byj^ to be the Boolean algebra 
v,{/cy||/|=fc}- 

For any I Q V, we write <I for the set of proper subsets of /. The 
principal algebras are those of the form Bv,<v = By^\y\_i. 

In all cases, we drop the superscript ^ to indicate the cr-algebra generated 
by the algebra. 

The algebras Byj- are generally uncountable, and so the corresponding 
cr-algebras Bv,x are generally non-separable. It is possible to recover separa- 
bility by allowing only formulas whose parameters come from an elementary 
submodel. (This causes some additional complications, since the slices of 
some set A C are no longer necessarily measurable; rather, the slices 
are measurable with respect to some slightly larger cj-algebra which depends 
on the choice of slice. These complications can be addressed by a small 
amount of additional model-theoretic work; this separable approach is used 
in [T3|l34].) These a- algebras are closely related to the Szemeredi Regularity 
Lemma; for instance, in [13] it is shown that the usual regularity lemma fol- 
lows almost immediately from the existence of the projection of a set onto 

^{i,2},i- 

Note that, while a cr-algebra is well-defined independently of the choice of 
a particular measure, notions like the projection onto a cr-algebra do depend 
on a particular choice of measure. 

The simplest interesting case of these algebras is B^i 2},1j which is a cr- 
algebra on consisting of sets of pairs which can be "defined one coordi- 
nate at a time" — B^i^2},i is generated by sets of the form 

{{x,y)\m^ip{x)^l;{y)}. 
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(The first introduction of these algebras is that we know of is in [32] . 
where Tao aheady notes the relationship with the Gowers uniformity norms 
which we will discuss in detail below. The work in this paper builds on 
further developments in [STIISS] .) 

There is some flexibility in the choice of the set X; for instance, ^{1,2, 3}, {{1,2}} = 
'^{1,2,3},{{1,2},{1}} (since {1, 2} G Z, we may already use formulas which refer 
only to the coordinate 1, so adding {1} does nothing). This leads to two 
canonical choices for X: a minimal choice with only the sets of coordinates 
absolutely necessary, or a maximal choice which adds every set of coordi- 
nates allowed without changing the meaning. Depending on the situation, 
we want one or the other canonical form. 

Lemma 3.2. // for every I £ I there is an I' € I' with I Q I' then 

Proof. It suffices to show that if / C /' then Byj C Byj,. But this is easily 
seen from the definition, since a formula containing only the variables wj 
certainly contains only the variables wji. □ 

Corollary 3.3. For any V,I, there exist Iq,Ii such that: 

(1) Bv,x = Bv,io = 13v,i„ 

(2) Iq is downwards closed: if I GZq and J ^ I then J € Xq, 

(3) If I, J £li then J ^ I. 

Definition 3.4. Given I,J^C V^V), we define I A J' to consist of those 
sets K such that there is an / G Z and a J G J" such that C / n J. 

4. Hypergraph Removal 

In this section, we present a proof of the ordinary hypergraph removal 
theorem, essentially the one given in [34], which is in turn based on the 
arguments in [30|[3Tj- We first state a necessary property on measures, and 
prove a lemma reminiscent of the hypergraph counting lemma. 

Definition 4.1. Let be a probability measure on By- We say has 

J -regularity for J C y if: 

Suppose X C V{V) and for each / € I, / fl J C /. For each 
/ G X, let // G L'^{Bv,i). Then if 5 G L~(i3y,j), 

l{g-n9\Bv,XAj))X{fi du"" = Q. 

lex 

When J = this is trivial. 

Theorem 4.2. Suppose has J -regularity for all J QV with \J\ < k, that 
k < \V\, X C and for each I £ I we have a set Aj G Byj- Further, 
suppose there is a 6 > such that whenever Bj G Byj and {Ai \Bj) < S 
for all / G X, H/ex-^/ ^-^ non-empty. Then [f^lj^j- ^l) > 0- 



6 HENRY TOWSNER 

Proof. We proceed by main induction on k. When k = 1, the claim is 
triviah we must have {Ai) > for all J, since otherwise we could take 
Bi = 0; then {{^ Aj) = H {^l) > 0. So we assume that k> I and that 
whenever Bi G Bv,i and ^^{Aj \ Bj) < 6 for all /, Pl/ex-^^ non-empty. 
Throughout this proof, the variables / and Iq range over I. 

Claim 1. For any Iq, there is an A'j^ € 'Bv,</q such that: 

• Whenever Bj € B^rj for each /, u^{Aj \ Bj) < 5 for each I ^ Iq, 

and v^{A'j^ \ Bj^) < 5, H/ex-^^ non-empty, and 
. If u^iA'j^ n Oi^i, Aj) > 0, u^iHi^iAj) > 0. 

Proof. Define A'j^ := {xj, \ E(xa,„ | Bv,<Io){xio) > 0}. If u^iA'j^ n 
r\i:jtio > then we have 



/ 



E(xA,„ I Bv,<i,) n XA.du^ > 



and since has lo-regularity, this implies that i'^{f]Ai) > 0. 

Suppose that for each I, Bj G Byj with u^{Ai \ Bj) < 5 ior I ^ Iq and 

v^{A'i^ \ Bi) < S. Since 

i^^'iAiM'io) = I XAj^il-XA'.Jdi^'' = J HXA,^ I Bv,<io){l-XA',^)di^'' = 0, 
we have (^/„ \ Bj^ ) < S as well, and therefore Pl/ex non-empty. H 

By applying the previous claim to each I G X, we may assume for the rest 
of the proof that for each /, Aj G Bv,<i. 

Fix some finite algebra B C Byj._^ so that for every I, \\xAi — ^{XAj \ 

'^)IIl2(i/^) < ^(jx|_|_i) (such a B exists because there are finitely many / and 
each Aj is ;By,jt_i-measurable). For each I, set A"^ = {aj \ E{xAi \ B){aj) > 

Claim 2. For each /, (Aj \ A*j) < 5/2 

Proof. Aj \ A*^ is the set of points such that XAj — '^{XAi \ B){a) > jx\+i' 
By Chebyshev's inequality, the measure of this set is at most 

m + ^fj iXA-HXA, I B)fdu'' = {\I\ + lf\\xA-EiXA, I < |. 



Claim 3. u^iCij Aj) > i^V{f]j A})/ + 1). 
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Proof. For each Iq, 



lex 



But then 



lei lei lo i^io ' ' /e|x| 



Each A*^ may be written in the form Ui<rj ^7 i where • = H jg( ^ ) ^/ i j 
and v4Jjj is an element of Byj. We may assume that if i 7^ i' then 

We have 

-^(n^i)=-^( u n n ^*.^.^)- 

For each i e n/[l>?7], let = n/nje(/J ^l„J,/- Each .^^^ is an ele- 
ment of By J, SO we may group the components and write = fljg^ ^ j 

whereL».^ = n73j^7,i,,j- 

Suppose, for a contradiction, that i^^(P)j74j) = 0. Then for every i G 
H/ili J'/], ^^{^i) = '^^(flj j) = 0- the inductive hypothesis, for each 
7 > 0, there is a collection -B^ j € j such that {D-^ j \B-^ j) < 7 and 
n 7 -B-:* f = 0. In particular, this holds with 7 = , . . ^ ^ , -. 

^ i,J ^ ' 2(^*J(njr-/){maxjr-j) 

For each I,i < rj, J C I, define 



Bh,j = A:},,jn n 



l,ll=l 



Claim 4. i^^U*. ,• t\B* j)< , ^ -. 

Proof. Observe that if x G A*j j\Bj ^ j then for some i with ij = i, x ^ 
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So 



2(fe-i)(maxj'^j) 



Define i?| = U<., fl j i^^- 
Claim 5. i^^{Ai\B*) < d. 



Proof. Since v^iAj \ A}) < 5/2, it suffices to show that i^^ {A} \ B*j) < 5/2 



-^(^1 \ u n ^h,j) = u n \ u n 



i<rj \ J J / 

i<ri J 

k \ 5 



< n ■ 



<5/2 



Claim 6. 



J- 



Proof. Suppose x e f]j Bj = f]j Ui<rj fl j ^7,j, j- Then for each /, there is 
an ij < rj such that x G f}j Bj j. Since Bj j C Aj j, for each / and 
J Cl,xe A*j -^ j. 

For any J, let / D J. Then 



In particular, x € 



B.j\J I J ^* ■ 



Bj^j U U/oj/Y^^^^' * ' ^-^ particular i wc have 

chosen. Since x £ A*t ■ , for each /' D J, it must be that x £ B^ This 
holds for any J, so x G f^j B^^ j. H 

From our assumption, P|j i?j is non-empty, and therefore there is some 
i such that f^jB^j-j. But this leads to a contradiction, so it must be 
that i'^{f]jA*j) > 0, and therefore, as we have shown, u^{f]j^j-Ai) > 



□ 
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In order to prove the hypergraph removal theorem, we would then hope 
to argue as follows: the failure of hypergraph removal implies the existence 
of a family of counterexamples of unbounded size. Wc could then take the 
ultraproduct of these counterexamples to obtain an infinite model in which 
i^^iClj^^v^ Aj) = for a family of sets Aj corresponding to the graph we 

are trying to remove. By the previous theorem, we would have an arbitrar- 
ily small family of definable sets Bj, and we would then argue that that 
these sets correspond to sets in the finite models whose removal causes the 
removal of all copies of the hypergraph. The only remaining difficulty in 
this argument is showing that the measure we obtain has J-regularity for 
ah J CV. 

In the remainder of this section, we carry out the proof for the dense case. 

Definition 4.3. Let ip{wv,ww) be a formula with the displayed free vari- 
ables. Then there is a corresponding function A^ : — >■ By given by 
A^{xw) = {xy I 9^ 1= vi^VjXw)}- Let be a probability measure on By- 
For any ip, define z/^ : — > [0, 1] by ly^^xw) = i^{A^{xw))- We say u is an 
definable Keisler probability measure if for every formula ip with parameters, 
is continuous with respect to the topology generated by B^r- 

Lemma 4.4. Suppose that for each J C. V, is a definable Keisler prob- 
ability measure on Bj such that for any L'^{u^) function f, J f du^ = 
JJ f dv^dv^^^ . Then has J-regularity for every J CV. 

Proof. We have 

[{g-E{g\Bv,iAj))llfid'''' 
I 

= J{g-E{g\ Bv,XAj)) J] fi du' dv^'^' . 



For each av\j, the function Wi fi{ai\^j,xif^j) is measurable with respect 
to Bv<j, so we have 



j{g-n9\Bv,XAj))]\fidv^ = Q. 
I 



Since this holds for every ay\j, the claim follows by integrating over all 
choices of ay\j. □ 

Definition 4.5. Let i^, A be fc-uniform hypergraphs on vertex sets V{K)^ ^(^) 
respectively, tt : V{K) — >■ V{A) is a homomorphism if whenever e G K, 
7r"e G A. (That is, tt maps edges to edges.) hom{K,A) is the number of 
distinct homomorphisms from K to A. If K, A are /c-uniform hypergraphs, 
we write 

hom{K, A) 



d{K, A) = 



\V{A)\\v(K)\- 
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Theorem 4.6. For every k-uniform hypergraph K and constant e > 0, there 
is a 5 such that whenever A is a finite k-uniform hypergraph with d{K, A) < 
6, there is a subset L of A with \C\ < such that hom{K,A\C) = 0. 

Proof. Suppose not. Let i^, e be a counterexample, and since there is no such 
5, for each n we may choose a fc-uniform hypergraph A^ with d{K^ A^) < 1/n 
such that there is no such subset L of Clearly |y(74")| oo. We 
view each A^ as a model, with M" = V{A'^) the set of points, A" a fc-ary 
relation V{A'^), and predicates making the normalized counting measure 
on V{A"')'^ definable for each J C V{K). In particular, this means the 
counting measure is a uniformly definable Keisler probability measure. 

Let V = V{K). For each / G ivT, let A"} = {xy \ xi G ^"}. Note that 
hom{K, A'^) consists exactly of the elements of 

I&K 

and therefore d{K, A"-) = i^niClieK ^/)- particular, we have i^XiClieK ~^ 
0. 

Now take an ultraproduct of the models (M"',A",...) to obtain 9?T = 
{M,A, . . .). (See [12] for the construction and, in particular, the demonstra- 
tion that the measures defined by , the ultraproduct of the u^^, extend 
to probability measures on Bj.) By [ITIIIS] (or see the next section), the 
decomposition in the statement of Lemma 14.41 holds in DJl, and therefore 
has /-regularity for all / C y. We have I'^iCljfzK ^i) = 0) ^■^d therefore by 
the previous theorem, there are Bj G Byj such that {E \ Bj) < and 

p^g^ Bj = 0. Let C = \Jj{Ai\Bi), so i/[i'^](C) < e. C is definable from 
parameters in M, and therefore 

f]{Aj\C) = $ 

I&K 

is a formula, which is therefore satisfied by the corresponding set in almost 
every (M", yl", . . .). Let C" be the set defined in the model (M", A^,...) 
by the formula defining C. Then there is some sufficiently large n such that 
v^{Cn) < e but C\i(^k(^1 \ ^ri) = 0, contradicting the assumption. □ 

Our goal is to obtain the same result when A is not a dense hypergraph, 
but rather a dense subset of a sparse random graph. The main idea is that 
we will replace with a measure concentrating on the sparse random graph; 
however this will not satisfy the easy Fubini decomposition we used for the 
dense case, so we will need to use the randomness — plus a large amount 
of additional machinery — to prove that the resulting measures nonetheless 
have regularity. 

5. Families of Measures 

To motivate our construction, we first consider the situation in large finite 
graphs. Suppose we have a large finite set of vertices G and a sparse random 
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graph r on G. There are two natural measures we might consider on subsets 
of G^: the usual normalized counting measure 

KS) = ^ 

and the counting measure normalized by F: 

When we consider subsets of G^, we have even more choices; we could nor- 
malize with respect to all possible triangles 

'^o('S') = i^jL' 

or only those triangles entirely in T 

. .c-x |{(x, y,z) € S \ (.T, y) £ r, {x, z) g T, {y, z) £ r}| 
\{{x,y,z)\{x,y)eT,{x,z)eT,{y,z)£T}\ ' 
or only those triangles where certain specified edges belong to T: 

^^^^-j ^ \{{x,y,z) eS\{x,y) £ V,{x.,z) £ r}| 
\{{x,y,z) I {x,y) € T, {x,z) G r}| 

Indeed, further consideration suggests that we have multiple choices for mea- 
sures even on subsets of G: in addition to the normalized counting measure, 
we could fix any element x E G and define 

\{yes\{x,y)er}\ 

\{y\ix,y)eT}\ ■ 

When r is a A;-uniform hypergraph, we have yet more possibilities. 

We therefore introduce a general notation for referring to all such mea- 
sures. We first describe this notation in the setting of a large finite graph, 
but we will primarily use it in the infinitary setting. Wc assume that a value 
for k and a fe-uniform hypergraph T have been fixed. We let y be a finite 
set of indices, and we describe a family of measures on G^: let W he a set 
disjoint from V, let xw G G^ , and let £^ be a fc-uniform hypergraph on 
VUW. Define 

r^x^ = {XV G I Ve € xe G T}. 

Note the significance of our notation for tuples here: fc-tuple which 

may consist both of elements from the fixed set x\y and from xy- E specifies 
which sets of k indices from FUW^ are required to belong to V. For instance, 
in the case where A; = 2, so F is a graph, F|^j^2)} = while F^ ^^-^^ = G^. 



We then define 

',Xw 



\rv I 1^ E,xw 



if|r^,.^l>o 



if|rL^I = o 
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For instance, in the measures above, A = jj]^^^ , A' = /u|||^2)} 0' = A*0^0^'^^) -^i 

/^{(Sk3),(2,3)},0'^2 = m{Smi,3)},0' and A, = /x|J}^2)},x- When W and xw 
are clear from context, we just write ii\ for IJ^^^, and call xw the 6ac/c- 
ground parameters of 

When integrating over fi^ , we always assume the variable being inte- 
grated is XV- 

A key feature of this notation is that it makes it easy to specify the Fubini- 
type properties that we would like these measures to satisfy. If F = Vb U Fi 
where Vq n Vi = and E' is the restriction of E to vertices from VoUW, we 
intend to have 



Vo 

E',xw' 



To avoid having to endlessly specify the restriction of E to the appropriate 
vertices, we will generally allow E to have extra edges not included in the 
vertex set V; for instance, we will not distinguish between ji^, and Mb 
and will usually write 



J ■dlJ^E,xwM = JJ ■dlJ^\xv^uxw'^f^E° 



"'f^E,xw ' 

even though E is not a subset of (^'^'^) • 

In our infinitary setting, we no longer have the underlying counting mea- 
sures to refer to, so we will have to define formally the properties we want 
a family of measures to have. 

We will use the meta-variable fi for a family of probability measures — 
technically, a function from appropriate finite sets to probability measures, 
so when /i is a family of probability measures, /Ug is an actual probability 
measure for suitable values of V, E, xw- 

Definition 5.1. Let 9Jt be a model. A weakly canonical family of probability 
measures of degree k and size d, fi, consists of, for any sets V, W with 
y n = 0, any /c-uniform hypergraph EonVUW with \E\ < d, and 
any xw G , a probability measure /J-^ such that: 

(1) For iLtg'-almost-every xw, xw ^ definable Keisler probability 

measure, 

(2) If no edges in E contain both w and an element of V then uX, ^ = 

h^E,Xw' 

(3) If TT : Vb U Wo — )■ Fi U Wi is a bijection mapping Vq to Vi and Wq to 
Wi and Tr(Eo) = Ei then ^ = //^i 

We say is a a canonical family of probability measures if additionally 

When F = Vb U Fi, Vb n Vi = 0, for /i^-almost every xw 
these measures satisfy the Fubini properties 



/ ■dfiE,xw = JJ ■dfJ'E,xy,Uxw'^f^. 



Vo 

E,xw ' 
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Weak canonicity merely enforces a certain amount of uniformity on these 
measures — the parameter Xw only matters if there is an edge in E connecting 
w to V, and the measures don't depend on the particular choice of indices 
used. The Fubini condition is non-trivial, and it is ensuring this property 
that requires us to work only with sufficiently random sparse graphs. 

We wish to work in models which have two additional features: first, the 
model actually includes formulas defining the all of the measures in the fam- 
ily /i. Second, the model contains extra function symbols max which pick 
out values maximizing certain integrals. (The construction of such languages 
has appeared a few times (see [l6l[35]; a general theory of constructions of 
this kind is given in [13]).) 

Definition 5.2. Let £ be a language of first-order logic containing a k- 
ary relation symbol 7, and let d be given. C'^''^ is the smallest language 
containing C such that: 

• Whenever ip{wv,ww,wp) is a formula with the displayed free vari- 
ables, is a set disjoint from V, E is a /c-uniform hypergraph on 
V UW with \E\ < d, and q € [0, 1] is rational, there are formulas 

and 

with free variables ww-,wp-, and 

• Whenever E' is a /c-uniform hypergraph with < d edges on a vertex 
set y, y = Vb U Vi is a partition oi V , W and P are finite sets 
with V, W, P pairwise disjoint, / is a rational linear combination 
of formulas with free variables wwiWy, and ip{ww -.wp^wy) is a 
formula with the displayed free variables, for each p ^ P there is a 
function symbol maXj^'^°''^''^(t(;iy, Wo)- 

Note that the formulas m\ < q.ip and < q.(p bind the variables 

wy- We will "abbreviate" these formulas as w^(v') ^ Q and w^^^^ ^ 
q respectively. We will abbreviate -<m^ {(p) < qhy ((/?) > q and 

< ^bym^ ,„^(99) > q. We view {maxp '^"'^''^(wh/, wp, u;yo)}peP 
as a tuple ma-x.^'^^'^'"^ {ww tWVo) of function symbols. 

Definition 5.3. If 9Jt is a finite model of C and T = 7^ is the interpretation 
of 7 in this model, we extend OJt to a model OJt'"''^ of C"'''^ by interpreting 

'^^'''^rnl,ajB)<q 

to hold iff 

whenever B is definable from parameters, and similarly for rn^ ^^,{B) < q. 

Suppose we have interpreted the formula ip and all the formulas defining 
the simple function /. Let B be the set defined by Lp. For each aw € 
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,xvo G M^°, we choose {ma.Xp'^°'-^''^ {aw , xvo))^^^ '^ to be some tuple bp 



maximizing 



Vi 



^0 



Note that we consistently use m to refer to the formula of first-order 
logic describing a measure, and /i to the actual measure corresponding to m. 
Also, note that in the interpretation of ma.-x.p'^''''^'^ [aw , xvq) , B depends on 
aw, XV, and bp, while / depends on only aw and xy- 

We need to ensure we work with sufficiently random sparse hypergraphs: 

Definition 5.4. Let T be a A:-uniform hypergraph on a set of n vertices and 
let V, W be disjoint sets and E C (^^^) . We write Sv,w,E ^ set 
of tuples aw G ^w such that there is some partition V = VoUVi and some 
set A C Fp „ such that 



> 5. 



Let r be a A;-uniform hypergraph on a set of n vertices. We say T is 
6, d- suitably random if whenever V, W are disjoint sets with |1/ U VF| < kd, 

\E\ < d, f^^{£^^w,E) < 

This definition says that "most" xvq € -T^^avK ^^^^ roughly the correct 
number of extensions to elements of . This can be seen as a hypergraph 

generalization of the notion of uniformity used in, for instance, [19] to prove 
versions of the regularity lemma in sparse random graphs. 

Let C be the language consisting of two A;-ary relation symbols, 7 and a. 

Theorem 5.5. Let e > 0. Suppose that for each n, r„ is a 5n,d-suitably 
random k-uniform hypergraph where 6n 0, and let An C r„ be given with 
\^n\ ^ e|rn[. Then each 9Jt„ = (r„,j4„) is a model of C Let lA be an 
ultrafilter on N and let S!Jt be the ultraproduct of the models QJtn""'^. Then 9Jt 
is a model of C^''^ such that: 

(1) SOT 1= o" ijf for U- almost- every n, a. 

(2) There is a canonical family of probability measures of degree k and 
size d, on the a-algebra generated by the definable subsets of 

such that whenever B is definable from parameters, 

f^l^^iB) = mf{q G Q>o [ 9?T N m^^^iB) < q}. 

(3) l^[{,%M)>e. 

(4) Whenever E is a k-uniform hypergraph with < d edges on a vertex 
set V , V = Vq\JVi is a partition of V , W and P are finite sets 
with V, W, P pairwise disjoint, f is a rational linear combination 
of formulas with free variables ww,wv, and ip{ww ,wp,wv) is a 
formula with the displayed free variables, for every aw £ M^,bp € 
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Proof. (1) The first part is the standard Los Theorem for ultraproducts. 
(2) That the measure //^ defined in this way extends to a genuine 
measure on By is the standard Loeb measure construction. Weak 
canonicity holds in each finite model, and therefore there are for- 
mulas holding in each finite model specifying that weak canonicity 
holds. By the Los Theorem, these formulas hold in 9Jt, and therefore 
the measures ^ are weakly canonical. 



Note that the formulas satisfied by rn^ in 9Jt and the actual 
measure almost line up: when B is definable from parameters, 

if l4:,a^ {B) <q then 9Jt h ml^jB) < q, but if 9Jt h m^^„^ (B) < q 



then we can only be sure that //^ (B) < q. 



To see that the measures /Xg are actually canonical, it suffices 
to show that for each B e and /xg'-almost every xw £ 

Suppose not; then for some set B definable from parameters, there 
is a set of xw of positive measure such that this equality fails. It 
follows that for some rational 6 > there is a set Xq of xw of positive 
measure such that 



f^lx^iBixw)) - J f^l\xy,uxjBixw)) d^g.^ 



> 6. 



We need to represent the integral in this definition closely enough 
by a formula to let us define a set of points where this violation 
occurs. Consider the function fxwixVo) = f^^,xvoUxw^B{xw))- We 

have < fxwixvo) < 1- 

(Roughly speaking, the problem is that integrals are not directly 
definable in our language, and there are "different ways" a function 
could have a given integral — say, by having a small number of points 
where the value is large, or a larger number of points where the value 
is smaller. However we will show that there must be a set of positive 
measure where the functions /^jj^ not only all have nearly the same 
integral, but all these integrals can be finitely approximated using 
the same level sets. This will allow us to write down a formula 
defining a set of points of positive measure, and with the property 
that every point satisfying this formula belongs to Xq.) 

We may partition the interval [0, 1] into finitely many intervals 
ij = [Si,Si+i) of size < (5/8 and with rational endpoints. Let us set 
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^i{xw) = {xvo I fxwi^Vo) e li} and 7ri{xw) = M£%j^(nj(xi^)), so 

when xw € -'^o, Ei '^i7ri(x^) < / fxw^lJ^^xw ^ (Jivri(xvi/) + (5/8. 

We choose Xi C of positive measure and, for each i, an interval 
Ji = {rij,r]j) with rational end points such that Tri{xw) G Ji for each 
Xw S Xi and 

i i 

Choose a rational a > very small, and let 

^ ^E,xv,uxjB{xw)) > Si - a}. 

Then Ili{xw) ^ n^l^^H') ^ind n'-(a;vi/) is definable. Let tt'-{xw) = 
f^^x '(^'i(^w))- By choosing a small enough, we may find a set 
X2 ^ ^1 of positive measure so that for xw e X2, each 7r^(a;iy) G Jj 
as well. 

Now we may consider the set 6 of xw such that 

Note that 6 is definable from parameters and C 0. 

Consider any xw £ ©, not necessarily in X2. Since each j;^(n^(a^W")) < 

Vi, 

J l^t^y,yj.jB{xw)) df^%^ < Y Si+iV'i < E ^^^i + V4. 
On the other hand, since each j;^(n^(a^Ty)) > 'Vi, 



/ 



(since we chose a small enough). 
So when xw G O, we have 

i i 

Therefore when xw G X2 C Xon0, we must have either fi^ ^^^{B{xw)) < 
J2i SiVi - S/2 or > X)i '^i'yi + S/2, and therefore 

V Iml^^iBixw)) > Y ^^^^ + V2 I • 
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Let ^ be the conjunction of this formula with the formula defining 0. 
Then we have SDt 1= ipi^w) whenever xw G X2, and therefore 9Jt 1= 

rri^ (ip) > C for some ^ > 0. We also have that whenever 9Jt 1= 'i/j{xw), 

'•E,xw 



XYqUxw 



it is actually true that 
S. 

Since the formula m^(V') > C holds in the ultraproduct, it also 
holds in infinitely many finite models. But any finite model where 
this holds fails to be (, d-suitably random. This contradicts the 
assumption that the finite models are d-suitably random with 

(3) The third requirement follows immediately the Los Theorem: the 
formula m^l(^l]yiA) > e holds in every finite model, and therefore in 

2Jt as well, and therefore > e. 

(4) Fortunately, the integral in this statement does not cause as much 
difficulty, since we do not need to deal with it uniformly in param- 



> e 



eters. Let f = ^aiXd- Whenever UxB{a^,xv^,bp)diJ^E^^^^ 
for some e, there is a formula holding of the parameters aw, xvo, hp 
which is a conjunction of components of the form 

"^eWo (^'C'*^' ^^0) ^ B{aw, bp,xvo)) <q 

or negations of such components, and which implies that the integral 
is > e. But then this formula holds in ZY-almost every finite model. 



which means that we must have 



I fx 



B(ai^ jXvq ,maxp' ''^ (avi,' , a; vj) )) "''^■^'^ 



e in ^-almost every finite model (where ap, etc., refer to the corre- 
sponding parameters in those finite models). But then this formula 



also holds in 9Jt, so 



I fx. 



^B(aw,xvQ 

Tl. Since this holds for every e < 
lows that 

^^B{aw ,xvq ,maxp'^o '■''■'^ {aw ,a: vj, )) ^^^'^Vg 



Vi 



I fXB{aw,xvo,bp)dl^E,xvo 



> e in 
it fol- 



> 



fXB{aw,3:vo,bp)dtJ'E,: 



□ 



6. Uniformity Seminorms 

6.1. Seminorms for Principal Algebras. In this section we define a fam- 
ily of seminorms, the Gowers uniformity seminorms, corresponding to the 
(7-algebras defined above. Fix disjoint sets V, P and a A;-uniform hypergraph 
E C (^^^) ; let m = |E n Q | and let ;U be a canonical family of measures 
of degree k and size Xl/S-E '^P such that the measure /U^^j^, 

and the measures we generate from it below, satisfy the appropriate Fubini 
properties. (We will only work with a finite family of measures, so the set of 
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such ap has //^-measure 1.) To avoid repeating the background parameters 
ap over and over, we will write /Ug as an abbreviation for //^ , 



,ap ■ 



Definition 6.1. For each / C y, we define M^^^ = where 
is given as follows: for each J e E and each co : J (1 1 ^ {0)1}) there 
is an edge J'^ = ( J \ 7) U {{i, uj{i)) \ ie Jnl}. 

H^'^^ is the measure obtained by making a second copy of the indices in 
I. For instance, if /Xg = /^|(|^2)}' measure whose underlying graph has 

two points connected by an edge, ^^^^ is a measure on 4-tuples, where the 
four coordinates are {(1, 0), (1, 1), (2, 0), (2, 1)} and there is an edge between 
each pair (1, b), (2, b'). 

Note that /ig^® = jJ.^- For i & I, b € {0, 1}, we write in place of 
for instance, we write 

f{xv\i,x°j,x])dfi^'^^ 

where the variables being integrated over are exactly the ones displayed. If 
00 : I ^ {0, 1}, we write Xj for the tuple xf(i) = x^^^\ 

Note that we chose the degree of our measure to be YliieE 2'^^^' because 
this is precisely the size needed to ensure Fubini properties for //g"^^. 

Definition 6.2. Let / : — > M be an ;By-measurable function 

with \V\ = n. Define || • by: 



Whenever we refer to the norm ||/||;7V(^^), we assume that / is 
and By-measurable. 

We have to check that these are well-defined. We actually prove the 
following stronger lemma, which will be useful later. 

Lemma 6.3. /// is an L°°(/ig) function and B is Bv,i -measurable for some 

lev then 



< / n f{x^)XB{x^)df.l+'' ^ / n f(^v)df^- 

a;e{0,l}^ a;e{0,l}^ 



v+v 

E 



Proof. It suffices to show the claim in the case when 1/| = \V\ — 1. Since 
/ = fXB + /Xs, we have 



/ n f{x^v)dnV'=j n [UXB){x-v) + UXB){^v)\d^>}i 



u;e{0,l}^' a;e{0,l}^ 
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/ 



Expanding the product gives a sum of 2 terms of the form 

where each Si_j is either xb or X'b- We wih show that each of these terms is 
non-negative. Since /Oweioi}^ f ixv)XB{xv)'^t^^^^ '^^^ these terms, 
both inequahties follow. 

Note that XScji^v) depends only on In particular, if there arc any 
u>,uj' G {0, 1}^ such that oj{i) = for all i € / but ^ S^', then for 

any x^r U x^,, Xsjx'^) = XS^{xf ) = Xsjxf ) ^ XS^,{xf') = XS^,{x'{^)- In 
particular, one of these two values must be 0, so the whole product is 0. 

So we may restrict to the case where S^j depends only on u \ I. Let v 
be the unique element in V \ I and let E' = E \ (^) . Then we have the 
decomposition 

The second equality holds because the graph E^'^^ used to defined the 
measure yU^"*"^ does not contain any edges containing both (f,0) and {v, 1). 

So we have 



n (fxsj{x^)dfil 



v+v 

i+i 



n (fXsJ{x^)df^l^^o^j^idfil^^o^^,dfi^+ 




n fxsAx'i,xl) n fxsAxi,xl)dfil^^o^^^dfil^^o^^^dfi 
io&{o,i}' we{o,i}^ 



i+i 

E' 



Since the inside of the integral is always non-negative, this term is non- 
negative. □ 

In particular, since /Iltjeloi}^ f {xv)dl-i^'^'^ > 0, ll/llfyv,-^^^-) is defined. 
Next we want a Cauchy-Schwarz style inequality for these seminorms: 

Lemma 6.4 (Gowers-Cauchy-Schwarz). Suppose that for each u G {0,1}^, 

is an L'^{fi^) function. Then 

a;e{0,l}^ 



n U^v)df^\ 
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Proof. Fix some v € V, and let / = F \ {v}. Note that we have the 
decomposition 



,v+v 



As above, the second equahty holds because the graph in fi^ does not 
contain any edges containing both {v,0) and (f, 1). For to G {0, l}''^ and 
b G {0, 1}, let us write cjb for the element of {0, 1}^ given by = a;(i) 

if z G / and (a;6)(z) = 6 if z = v. Therefore, using Cauchy-Schwarz, we have: 

2 



v+v 

E 



wG{0,l}^ 



/(/ n « 



Xj 



a,e{0,l}^ / V '^£{0,1}^ / 



- \ n /^0(a;7,X„)d//^^^0u4 / n f^^i^'l^^v)dl2E,x'>Uxj] ^I^E 

< / n fHi)oixv)dfil^'' [ n /(-r/)i(^y)^M 



u;e{0,l}^ 



In particular, applying this repeatedly to each coordinate in V, we have 



v+v 

E 



< n / n u^v)d^i 

we{o,i}^ a;'e{o,i}^ 

u}&{0,l}V 



V+V 
E 



□ 



Corollary 6.5. |/ < 

Proof. In the previous lemma, take fgv = f and /^^ = 1 for a; 7^ 0^. 
Lemma 6.6. || • ||;7V(^v^ is a seminorm. 

Proof. It is easy to see from the definition that ||c/||[7V(ju^) = |c| ■ ||/||[/V(^v 



□ 



To see subadditivity, observe that | expands to a sum of 2^' ' in- 

|2l^l-m 



l2l^l • 



tegrals, eachofwhich, by the previous lemma, is bounded by 
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for a suitable m. In particular, this bound is precisely ( ||/||;7V(^V) + ||5||(7i^(^]^) 
as desired. □ 

In particular, the work above gives: 

Theorem 6.7. If \\E{f \ Bv,<v)\\lH^^^) > ^ ^^^^ H^Ht^^CM^J) > 

Proof. If ||]E(/ I Bv,<v)\\l'2(pV) > then we may find, for each I with 



\V\ — I, Bj £ By I such that 



< 



fYlxBrdfJ^E 



I 



< WfUxBjWuv^f^vy 
I 



By repeatedly applying Lemma |6.3| once to each /, we have 
^<\\fIl^B,\\uVi^.v^<\\f\\uVi^.vy 



□ 

We will obtain the converse, which will show that | Bv,<v)\\L^{py) > 
iff ||/||;7V(^V) > 0, and in particular will enable us to show that fj, has J- 
regularity. 

6.2. Characterization in Product Measures. 

Definition 6.8. We say fi^ is a product measure if no element of E contains 
more than one element of V. 

(Recall that /U^ abbreviates fi^ , so there may be edges in E connecting 
elements of V to elements of P.) We call such measures product measures 
because they are extensions of the ordinary product measure Hijev /^s- 

We are not concerned with the converse in the case where P 7^ 0, so we 
state it only when EC . 

Theorem 6.9. If is a product measure, and |[E(/ | Bv,<v)\\L2(f_iV-^ = 
then =0. 

Proof. This is essentially identical to the argument we gave for regularity 
for ordinary measures. Suppose ||E(/ | Bv,<v)\\L^{^y) = 0. We have 



v+v 

E 



we{o,i}^,w7^o^ 
= // /(^v-) n fix^)df4dt^l 



This last equality holds because fi^ is a product measure, and so the inner 
copy of does not depend on the choice of Xy. 
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Observe that, for every particular value of Xy, riujeio i}^ w^^o^ fi^v) 
-Sy <i/-measurable, and therefore 



we{0,l}^,a>^0^ 

□ 

6.3. Seminorms for Non-Principal Algebras. We will need a more gen- 
eral family of seminorms corresponding to arbitrary algebras of the form 
Bv,i. 

Definition 6.10. For J C F, define 

2-\J\ 



(/ n /( 

V ^e{o,i}^ 



Note that WJWuiy^^^V) = lU llc/^K)- 

We need to generalize to norms C/^-^ where J is a set. A natural choice 
would be to take the product of U^'"^ over all J ^ J ^ but this is not a 
seminorm. Instead we need the following form: 

Definition 6.11. Let J C V{y) be a set such that if J, J' & J are distinct 
then J % J' . Then we define 



i<k \jej / 



where the infimum is taken over all sequences fo,---,fk such that / = 
Yli<k fi- 
ll is not immediately obvious that U^'^ and U^^'^^ calculate the same 
value, but this will follow immediately once we show that U^"^ is a seminorm. 

Lemma 6.12. /// is an L°°{fi^) function then 

Proof. Let V = V\J. We have 

/ n .f{xv',x^j)df^l+' = II n f{xv',x-j)df^i+idn^E 



Vy, 

a;e{0,l}'' " " we{o,iy 



I 



■■V 

> 0. 



□ 
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Proof. First consider the case where J" is a singleton {J}. Again, let V 
V\J. 





2lJ| 







< 



< 



v 



dl^E 



2lJ| 



For the general case, first observe that, setting c = 

2lJ| 

n 

Jej 



I 



fdfiE 



fdfiE 



So if / = Y,i<k fi we have 



ffd^^^E <E / 

i<k 



f^d^i 



□ 



sE n 

i<k \JeJ 

This holds for any J2i<kfi' so \ J fdfx^l < WfWu^^d^V) 

Lemma 6.14. 11 • ILv.j-. v\ is a seminorm. 

'-'oo yi^Ei 

Proof. Once again positive homogeneity is obvious from the definition, so 

we need only check that the triangle inequality holds. 
We first consider the case where ^ is a singleton: 



< 



jwf 
jwi 



+ 9 



21-^1 2\-'\ 



< 



\2\J^ 



For \J\ > 1, we may use the fact that if / = X)j<fc fi and g = ^j<^gj 
then f + g = E^fe h + E j<m 9j ■ □ 



24 HENRY TOWSNER 

The main thing that makes the uniformity seminorms useful to us is that 
they easily pass across different measures: 

Lemma 6.15. Let J <^ V and V' = V \ J . If (^J) = then for 

ji^ -almost-every xv , WfWuJ (u-' ) = 0- 

Proof. 

= 11/11 

n /(^j)^/^^ 

a;e{0,l}-' 



•' a;e{0,l}-' •' 



J+J 



Therefore for /Lt^'-almost-every xy, \\f\\uJ {^J. ) ~ ^- '-' 

In order to associate these more general seminorms with the correct alge- 
bras, we introduce the following definition: 

Definition 6.16. If X C V^V) is non-empty, we define X-*- to be the set of 

J C.V such that: 

(1) There is no I G I with J Q I, 

(2) If J' C J then there is an J G T with J' C I. 

J- = {ICV\J^I}. 

and depend on the choice of the ambient set V. Note that X"*- always 
has the property that if J, J' G X"*- are distinct then J ^ J', so \ \-\\ vi^ , v. 

Uoi (Mis) 

is defined. Also, if X = {/ C y | |/| = \V\ - 1} then X^ = {V}. Also, note 
that {J-y = {J}. 

We will show that when ji^ is nice enough, Bv,x and njei-"- ^v,J- agree 
up to measure 0. 

Lemma 6.17. // there is no J & J such that J Q I and B is Byj- 
measurable then 

Proof. It suffices to show this for J' a singleton {J}. Write V = V \ J. 
Observe that for any fixed xy, the function XBixy, ") is a ^Bj^jnj-measurable 
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function, where J PI / must be a proper subset of J. So we have: 



v 

E 



□ 



Theorem 6.18. //||E(/ | Bv,i)\\l2(^v) > then ||/|l ^^.ix, ^, > 

Proof. If 1|E(/ I 'Sv,x)|Il2(^v;) > then we may find, for each I G 
Bj £ Sv,i, such that 



< 



fYlxBidfx^ 



< \\fIlxBA\rrV,T^,.,vv 



Observe that for each / E I we may apply the previous lemma, so we have 

ifj^E) Uoo (Me) 

□ 

Definition 6.19. Let /i be a canonical family of measures of size k and 
degree YlieE 2'^^^^'. For some I C 'P{V), we say t/^'"^ (Me) is characteristic 
if for each /, ||/||^v,i-l^^v-) > implies |[E(/ | Sv,x)||^2(^V) > 0. 

Lemma 6.20. // U^'^^fi'^) is characteristic for each I C 'P( J) then /i^ has 
J -regularity. 

Proof. Let J C y and I C T'(y) be given, and let g and // be as in the 
definition of regularity. Let h = g — E,{g \ Bv,x/\,j) = 9 — ^{g \ Sj,xaj) (view- 
ing gf as a function on Bj). Since ||E(/i | ^j,xaj)|Il2(/j^) = Oi by assumption 
we have ||/i|Lj,iaj, J^ = 0. Then we also have ||/ijL.j,iAj, j , = for 

Uoo (Pe) Uoo (^^'E,Xy^^J) 

/i^^'^-almost-every xy^j. (The exact choice of which set of measure 1 this 
holds on depends on the choice of representative of h.j^ 



^ We note the similarity of this argument to the one in [21] . The argument there uses 
two equivalent characterizations of a regularity type property, DISC and PAIR, the former 
analogous to having projection and the latter to having uniformity norm; a key step 
in that DISC implies PAIR in the dense setting, PAIR in the dense setting implies PAIR 
in the sparse setting, and then PAIR in the sparse setting implies DISC. 
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Including xy\^j as part of the background parameters, Theorem 16.181 im- 
phes that ||E(/i | Bjxaj)\\l'2(l.-' \ = 0) and so 



I I 



V\J 
E 



= 

since for almost every fixed xv\j, 11/ ;Bj^xAJ-™easurable. □ 



6.4. Characterization for Non-Principal Seminorms. The principal 
seminorms are the controlling case for showing that the uniformity norms 
are characteristic: in this subsection we show that if the principal algebra of 
a given size is characterized by its uniformity norm then all algebras of the 
same size are characterized by their uniformity norms. We only need this 
for the case of a product measure, but we include the general argument for 
completeness. 

Lemma 6.21. Let T he given and let J V. If /i^ has J -regularity then 
By^XA{J} = Bv;i n Bv,j up to fj,^-measure 0. 

Proof. By definition, we have By^xAiJ} — ^y,'^ Bv,j- 

For the converse, we first show that if g is Sv,j-measurable with '&{g \ 
Byx/\{j}) = 0, also 'E{g \ Bv,x) = 0. Let such a g he given, and for each 
I £ I, let // be ;Sy_/-measurable, so Y\j fj is i3y^i-measurable. Since g = 
g - E(5r I Bv,XA{j}) and /i^ has J-regularity, 

/ g\{fidiil = Q, 

I 

showing that E((7 | By^x) = 0. 

Now let g Bv,x H ;Sy_j-measurable be given. It suffices to show that g' = 
g — E{g I Bv,XA{J}) = 0- But g' is Bv,x H ;By^j-measurable and satisfies 
K{g' I Bv^xA{J}) = 0) and therefore satisfies K(g' \ Bv,x) = g' = 0. □ 

Lemma 6.22. For anyZ, J C V^V^, if has J -regularity for every J £ J 
then Bv,x/\j is Bv,x H Bv,j up to p^ -measure 0. 

Proof. The direction Bv,xaj ^ Bv,x H Bv,j is immediate from the definition. 

For the converse, we may assume J = {Ji, . . . , J^} where i ^ j implies 
Ji 2 Jj, and we proceed by induction on n. When n = 1 this is just the 
previous lemma. Suppose the claim holds for J and we wish to show it for 
JU{J}. Note that 



BxA{JU{J}) = B(XAJ)U{XA{J}) = BxAj W BxA{J}- 
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It suffices to show tliat wlienever / is ;By^x-measurable then E(/ j By jij^jy) 
is yBy^j/^i^j-uiJD-measurable. For any /, we have 

nf\13v,ju{j})=m\Bv,j^Bj) 

= E(/ I Bv,j) + m I Bj) - E(/ I Bv,j n Bj). 

When / is ^v'.x-measurable, E(/ | Bv,j) - E(/ | Bv,j D Bj) is Bv,x n Bv,j- 
measurable, and therefore, by IH, ;By_jAj'-™easurable. By the previous 
lemma, E(/ | Bj) is ;Byi/^|j}.-measurable. In particular, this means E(/ j 
^v,jvj{J}) is BxAj tt) BxA{j} = ^XA(ju{J})-measurabIe. □ 

Lemma 6.23. If n\ has J -regularity for every J G I^, Bv,x is Pljex^ ^V,J- 
up to fi^-measure 0. 

Proof. We have Hjex^ By^j- is 'Sv,Ajgi± /i^-measure (it is easy 

to see that A is associative and commutative, so this follows by repeated 
application of Lemma 16.22 p . We need only check that Ajex^ J — 

If / G I (or even I C I' ^ I) then for every J G X"*-, we have J I, 
and therefore I G J~ , and therefore / G Ajgx^ J~ ■ Conversely, if there is 
no I' £ I such that I I' then there is a J C / such that J G X"*", and 
therefore no J' G J~ such that I Q J' , and therefore / Ajex^ --^ ■ ^ 

Note that the following theorem is one of the places where we directly 
appeal to the definability structure of our c-algebras. This is for a good 
reason: the statement would not be true if we replaced our cr-algebras with, 
say, simple product algebras. 

Theorem 6.24. Suppose that for every J G X"*- and every J' C J, U^{fij^) 
is characteristic and that for ^^'^ -almost-every xy, U^^j."^ ^^^) is charac- 
teristic. Then (/^e) is characteristic. 

Proof. We proceed by main induction on \V\. In particular, if 1/ G X-*- then 
the claim is given by the assumption, so we may assume that every element 

J G X-*- has I J| < \V\, and so by IH, each [/(^"^ (/"e) is characteristic. 

We start with the case where = {J}. Suppose v\ > 0, so, 

setting V' = V \J , also 

Since almost every U^lfij^ is characteristic, there must be a set 5*0 C 
M^' of positive measure such that, for xy G Sq, . j , > 0. By 

assumption, for almost every xy/ G Sq, we have ||E(/ | 'Sj^<j)||i2f„j ) > 
0. This means that for almost every xyi G S'o, we may choose a set 
B{xy< ,h^^xy,) G B^j^^j such that |/ fXB(xyi)<^\^E,xy\ > 0- (^o^e that here 
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even the index set Q^v' may depend on xyi.) There are only countably 
many formulas, so we may assume that there is a single formula defining 
B{xyi ^bq"), independently of xy € 5*0. However there are uncountably 
many choices of parameters, so we may not assume that the parameters 
are independent of xy'- We may choose an e > 0, an approximation of 
/ by a simple function and a set S\ C of positive measure so that for 



I > e- 



E tJ B 

Recall the distinguished function symbols maxg' '•' ' ; these symbols 
choose values of the parameters in ip maximizing the value of | J f'x^^x , b'°y')dt^E 



Xyl I 



So, replacing B{xvi,hQ ) with B{xyi^aw) = B{xyi ^macKQ '■' ' {xy',aw)) 
(where aw are the parameters in the definition of /'), 



J f'XB(xy,,aw)'^l^'k^v' - J ^'^B{xy,,bl 



^^^^'•)df^E,Xy, 



In particular, for each xyi G Si, 



If ,Xyl 



> e. Note that 



B^xyi ,aw) £ ^j<j and therefore i3{aw) € Sj 



J,J- 



We may partition 5*1 = S^US^ where a;y/ G iSj*" exactly when f f'XB(xy,aw)'^^'^^v' — 
e. Clearly at least one of and has measure > fi^ {Si)/2; with- 
out loss of generality, we assume does. Since /' is simple, we have 
/' = Yli<n ^iXCi- We may write a large union D of sets consisting of those 
xy such that 

(jnE^x^,{Ci{xy>,aw)r^B{xv>,aw)) < Pi /\mE^^^XCii^V',aw) B{xv',aw)) > pC) 
A--- 

^ [iT^E,xy,{Cn{xv',aw) riB{xv>,aw)) < Pn A mE .^^XCn{xy',aw) H B{xv',aw)) > 



so that ji^ {D n S^) > (1 — (^)Me {^t) and every element of D satisfies 



/ 



The formula defining this set has only free variables awjXy, so D is Byyr- 
measurable. Then 

/ fXBixy,,aw)^DdlJ.ldfjL = jj f'xB(xy,,aw)^Ddfii^^^,dfJ,l' > e(l-(5)/i^' (5i)/2. 

Since we chose /' to be an arbitrarily close approximation of /, we may 
assume that ||/ — /'||i2(^V) < e(l — (S'i)/4, and so we have 



/ 



fXfi 



B{xy/,aw) 



XDdnldfi > e(l - 5)i4; {Si)/A > 0. 



Since Xb{x , aw)^^ Hy j- -measurable, we are finished. 
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For the general case, suppose | Sv,x)\\l'2(^^v^ = 0. By Lemma 

I6.2U|, fi^ has J-regularity for each J € , so by Lemma \(i.2'6\ By^x = 
Cljex^ ^V,.J- , and so ||E(/ [ Cljex^ ^v,J~)\\L^f,v) = 0. LetZ-^ = {Ji,. . . , JJ. 
Then we may define a sequence /o = /, /j+i = E(/j | Byj- ), 

f = fr + Ur-l - fr) + (/r-2 " fr-l) + ' ' ' + (/o " /l) • 

Smce fr = E(/ | Hjex^ ^V,J-)^ have fr = 0. We therefore have 



i<r \JeX-l- 

For each i <r, fi- fi+i = fi - E{fi \ By j~ ). In particular, ||E(/i - fi+i \ 
^VJ- )\\l2(u]:) = 0, and therefore by the previous part, ||/i-/i+i|Lv,j,+i ^ 
0. But this means the whole sum is 0, and therefore ||/||r7v.j'/ v\ = 0- D 

7. Random Measures 

Theorem 7.1. Suppose fi is a canonical family of measures of size k and 
degree Ylies'^'^^^^ ■ Then U^{fj,^) is characteristic. 

Proof. We show a more general result: 

Let I C y be given and let V' = V\I. Let 

F{xv',x%x])= Y\ fUxv',x'f) 

iu(^{0,l}' 

be such that 



< y F{xv',x°j,x]) 



Then||E(/o. I ^y,<y) I Il2(^V)>0. 

The main result is then the case where I = V and ftiui = f for all w. 

We proceed by induction on |/|. When 1 = 0, this is trivial, so assume 
|/| > 0. 

Fix some u € /, and let I' = I \ {v}. For each co G {0, 1}^ and each 
b S {0, 1} we will write cob for the corresponding elements of {0, 1}^. We 
define a function 



G{xv',xp,x],) = j f^i{xv',xp,xl)dn 



V 

E,Xy,Ux°,Ux^j, ■ 



Let W be the vertex set of the measure /x^^^^^^^ ; that is, W = V U {I' x 
{0, 1}). Let E' be the corresponding hypergraph, so fi^"^^^^^^ = fi^,. Each 
edge I oi E corresponds to at most 21^^^^! edges of E\ so Y1i<^e' 21-^'^^! < 
S/e-B^^'^'- (There is likely some room here for optimizing the exact degree 
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of the canonical family needed.) Let J C V{W) be the collection of subsets 
of the form 

V'u{{i,uj{i)) \ 

for some u € {0,1}^'. That is, J consists of those sets which contain V' 
together with exactly one copy of each coordinate from /'. The elements of 
J'-^ are pairs J = {{i, 0), (i, 1)} for some i G /'. No edge of E' contains both 
elements of one of the pairs {{i,0), {i, 1)}, so fi'^, and fij^, xy^,^j, product 

measures, and in particular, U^{fij^,) and U^lfij^, ^^_^^) are characteristic 
by Theorem 16. 9i 

We claim that G is i3vy,j7-measurable (with respect to the measure /U^). 
Suppose H is a function with ||IE(f/' | Sw,j)\\l2{^w^'j = 0. By Theorem 16.241 

U^"'^ (fj,^,) is characteristic, so \\H\\ w,j^f w\ ~ 0' ^'^'^ therefore for 

almost-every x^, \\H\\ ^ =0, and so \\E{H \ Bw,j)\\L^iaW , ) = 0. 

Then 

.W 



J H{xv',x%,x]i) ■ G{xv' , x% , x]i)dn^. 

H Yl f-^l(.^V',x'f,,xl)d^il^^^^^^o^^^,d^^% 



de{o,i} 
=0 

Since this holds for any H with | Bw,j)\\l2(^^w^'^ = 0, it follows that 

G is jBvK,j'-iiieasurable. This means that we may write 

G{xv',X^j,,Xp) = lim TT gcj,n,N{xv',x'^,) 

in the L^(^^)-norm. We may assume the guj,n,N are L°°{^^,) functions. 
Then we have some e such that 



^<^< j n fUxv',x'f)dn 

= n fiooixv',xp,x'^) Y[ fujiixv',xp,xl)dn 
= Yi f^oixv',xp,x^)G{xv',Xp,Xp)dn^'^^' 



v+i 

E 
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Choosing large enough, we may make 

\\G{xv',X^p,x],) - ^ Y[ gLU,n,N{xV',Xp)\\L2^f,w-^ 
n<Afaje{0,l}^' 

e 

2 Ucoe {0,1}^' 1 1 -^^0 V" , , ) 1 1 ^^v+i' ^ 
and therefore 
0<e/2< / J] f^oixv 1 Xj , x„ 

) X] n 9cj,n,N{xv',xf>)dn'^ 
a;e{0,l}^' n<N u;e{0,iy' 

= X] / n ^o(a;y',Xj,x°)5f^,n,Af(a;y',x';',)(i//g+^' 

".^ uie{o,iy' 



In particular, there must be some n such that 

0< / ]J /a;o(a;y',a;j,x°)5ra;,n,Af(a;y',Xj,)d/i^+'^'. 
c^e{o,i}^' 

Consider the functions given by, for each oj € {0, 1}^ , setting = fujoguj,n,N- 
We apply IH to I', and conclude that ||E(/^^, | Bv,<v)\\l^[^v-^ > 0. Since 
9o'' ,n,N '^^^vy'vjr ^ -Bv,<v-measurable, it follows that ||IE(/oi | ^v,<v)\\l'^{^v-^ > 
as well. □ 

We can now give a sparse version of the hypergraph removal lemma: 

Theorem 7.2. For every k-uniform hypergraph K on vertices V and every 
constant e > 0, there are 5, ( so that whenever T is a C,\K\2^^ -suitably 
random k-uniform hypergraph and An C r„ with ^°™(-^^") ^ ^ then there 
is a subset L of A with \L\ < e\T\ such that hom{K,A \L) = 0. 

Proof. Suppose not. Let e be a counterexample. Since there are no such 
S, for each n we may choose A;-uniform hypergraphs ^ r„ with r„ 
1/n, |-fC|2^'^-suitably random and ^°™(^'-^) < l/n. Let be the model 



given by Theorem 15.51 

Let V be the set of vertices K. For any J C y, Theorem 17.11 implies that 
^ooit^K:) is characteristic, and therefore by Lemma r6.20l has J-regularity. 
For each I & K, let Ai = {xy \ xi € A}. Since ^""l^'^'v^"-* — )■ 0, we have 
Aj) = 0. Then by Theorem 14.21 there must be definable sets Bj 
such that \ Bj) < e/\K\ and fl/eJ^ = ^- ^ = y^iaK^^i \ ^i)^ 

so < e. C is definable from parameters in M, and therefore the 

properties Ai|\'^}(C') < e and H/Gi^l^/ \C) = % are witnessed by formulas. 
Therefore there must be arbitrarily large finite models where these formulas 
are satisfied. But this contradicts the choice of the hypergraphs Hn, r„. □ 
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8. Conclusion 

The notion of suitable randomness used in this paper is strong compared 
to other notions of pseudorandomness for hypergraphs that have been con- 
sidered [3ll4ll6|l22] . The next step towards developing a rich analytic approach 
to working with sparse random hypergraphs would be a detailed investiga- 
tion of the relationship between notions of pseudorandomness in the finite 
setting and the corresponding properties of measures in the infinitary set- 
ting. With less than suitable randomness, we would expect to lose the full 
Fubini property, but the notions that replace it are likely to be of interest 
themselves. 

The approach Conlon and Gowers use to prove hypergraph regularity [5] 
depends, like our approach, on the use of various norms to detect the pres- 
ence of certain properties. Their norms are much more narrowly tailored 
than the general uniformity norms. The uniformity norms are strikingly 
natural in the infinitary setting, lining up with canonical algebras of defin- 
able sets; it is possible that other norms also correspond to algebras which 
might be of independent interest. 
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